
SEQUENCE LISTING 



(1) GENERAL INFORMATIO: 



DAVID N. 



(i) APPLICANT: RAfl)IN, 
CR VMER, 
OI 3HI, 
WE CSSENB6RN 



(ii) TITLE OF INVEblTION; 
PLANT-BASE 3 



(iii) NUMBER OF SEQJENCES 



(iv) CORRESPONDENCE ADDRESS 

(A) ADDRESSE 

(B) STREET 

(C) CITY: NeW 

(D) STATE: N^w 

(E) COUNTRY 

(F) ZIP: 100$ 



(V) COMPUTER READABLE 

(A) MEDIUM TyPE: Fljoppy 

( B ) COMPUTER 

(C) OPERATIN(^ 

(D) SOFTWARE 



(vi) CURRENT 

(A) APPLICAT 

(B) FILING DATE J 
( C ) CLASS IFIC ATIO; 



CAROLE L. 

DEBORAH L. 



K\REN 



EXPRi:SSION 



Penr.ie 
155 A\ 
York 
YorV 
USA 
6-271 



IBM 



SYSTEM 
Patent I 



APPLICATION 



(vii) PRIOR APPLICAHON 

(A) APPLICATipN 

(B) FILING 



(viii) ATTORNEY /AGENT 

(A) NAME: Cor 

(B) REGISTRAT 



PRODUCTION OF LYSOSOMAL ENZYMES IN 
SYSTEMS 



12 



& Edmonds 
enue of the Americas 



FCRM: 



PC 



disk 
compatible 

PC-DOS /MS-DOS 
n Release #1.0, Version #1.30 



DATA: 
::ON NUM3ER: 



N 




Dft TE : 14 -SEP-199 5 



US 60/003,737 



.TION : 
aura A. 
;BER: 30,742 



(C) REFERENCEKDOCKEtI NUMBER: 7956-0011-999 

(ix) TELECOMMUNICATION INFORMATION: 

(A) telephone! (212)1790-9090 

(B) TELEFAX: (1212) 869-9741 

(C) TELEX: 660141 PENl^IE 



(2) INFORMATION FOR SEQ ED NO:l! 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 Ibase pallrs 

(B) TYPE: nuclaic acid\ 

( C ) STRANDEDNEStS : s ing 

( D ) TOPOLOGY : unknown 



(ii) MOLECULE TYPE: other nuc 
(A) DESCRIPTION 4 /desc 



leic acid 

= "PCR primer" 



(xi) SEQUENCE DESCRIPTi^ON : SE^ ID NO:l: 
TTGTCTAGAG TAAGCATCAT GGCTC*5C 



27 
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(2) INFORMATION FOR SEC 

(i) SEQUENCE CH2^ 

(A) LENGTH: : 

(B) TYPE: nuc 

(C) STRA 

(D) TOPOLOGY 



STRANDED^ESS 



ID NO: 2: 

CTERISTICS: 
3 base pairs 
leic acid 

single 
unknown 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "PCR primer' 



(xi) SEQUENCE DESCRtLPTION : SEQ ID NO: 2: 
CACGAATTCT GGCGACGCCA CAGGTAGGTG TGA 
(2) INFORMATION FOR SEQ 1 ID NO: 3: 

(i) SEQUENCE CHARAiTERISTICS : 

(A) LENGTH: 1642 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: unknown 

(D) TOPOLOGY: lunknown 

(ii) MOLECULE TYPE: bDNA 



33 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
ATGGAGTTTT CAAGTCCTTC CAGAGAGGAA TGTCCCAAGC 
ATGGCTGGCA GCCTCACAGG TTTCCTTCTA CTTCAGGCAG 
CGCCCCTGCA TCCCTAAAAG CTTbGGCTAC AGCTCGGTGG 
TACTGTGACT CCTTTGACCC CCC^ACCTTT CCTGCCCTTG 
AGTACACGCA GTGGGCGACG GATdCGGCTG AGTATGGGGC 
GGCACAGGCC TGCTACTGAC CCTGCAGCCA GAACAGAAGT 
GGAGGGGCCA TGACAGATGC TGClfccTCTC AACATCCTTG 
AATTTGCTAC TTAAATCGTA CTTctcTGAA GAAGGAATCG 
CCCATGGCCA GCTGTGACTT CTCCATCCGC ACCTACACCT 
TTCCAGTTGC ACAACTTCAG CCTCOCAGAG GAAGATACCA 
CACCGAGCCC TGCAGTTGGC CCAGCfcTCCC GTTTCACTCC 
CCCACTTGGC TCAAGACCAA TGGAGCGGTG AATGGGAAGG 
GGAGACATCT ACCACCAGAC CTGCcicAGA TACTTTGTGA 
GAGCACT^GT TACAGTTCTG GGCAG-^GACA GCTGAAAATG 
AGTGGATACC CCTTCCAGTG CCTGGgIcTTC ACCCCTGAAC 
CGTGACCTAG GTCCTACCCT CGCCAACAGT ACTCACCACA 

gatgaccaac gcttgctgct gccccactgg gcaaaggtgg 
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ctttgagtag GGTAAGCATC 60 

TGTCGTGGGC ATCAGGTGCC 120 

TGTGTGTCTG CAATGCCACA 180 

GTACCTTCAG CCGCTATGAG 240 

CCATCCAGGC TAATCACACG 300 

TCCAGAAAGT GAAGGGATTT 360 

CCCTGTCACC CCCTGCCCAA 420 

GATATAACAT CATCCGGGTA 480 

ATGCAGACAC CCCTGATGAT 5 40 

AGCTCAAGAT ACCCCTGATT 600 

TTGCCAGCCC CTGGACATCA 660 

GGTCACTCAA GGGACAGCCC 7 20 

AGTTCCTGGA TGCCTATGCT 7 80 

AGCCTTCTGC TGGGCTGTTG 840 
ATCAGCGAGA CTTCATTGCC 900 
ATGTCCGCCT ACTCATGCTG 9 60 

TACTGACAGA CCCAGAAGCA 1020 



GCTAAATATG TTCATGGCAT TGCT3TACAT TGGTACCTGG 
GCCACCCTAG GGGAGACACA CCGCCTGTTC CCCAACACCA 
TGTGTGGGCT CCAAGTTCTG GGAG::AGAGT GTGCGGCTAG 
CAGTACAGCC ACAGCATCAT CACG^ACCTC CTGTACCATG 
AACCTTGCCC TGAACCCCGA AGGA5GACCC AATTGGGTGC 
ATCATTGTAG ACGTCACCAG GGACACGTTT TACAAACAGC 
CACTTCAGCA AGTTCATTCC TGAGGGCTCC CAGAGAGTGG 
AACGACCTGG ACGCAGTGGC ACTGJ .TGCAT CCCGATGGCT 
AACCGCTCCT CTAAGGATGT GCCTC TTACC ATCAAGGATC 
ACAATCTCAC CTGGCTACTC CATTqACACC TACCTGTGGC 
AAGGACGACG ATGACAAGTT GA 
(2) INFORMATION FOR SEQ ID NO: 4: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 546 amino acids 

(B) TYPE: cimino £ 

(C) STRANDEDNESS : 

( D ) TOPOLOGY : unk r 



ACTTTCTGGC 

TGCTCTTTGC 

GCTCCTGGGA 

TGGTCGGCTG 

GTAACTTTGT 

CCATGTTCTA 

GGCTGGTTGC 

CTGCTGTTGT 

CTGCTGTGGG 

GTCGCCAGAA 



TCCA6CCAAA 
CTCAGAGGCC 
TCGAGGGATG 
GACCGACTGG 
CGACAGTCCC 
CCACCTTGGC 
CAGTCAGAAG 
GGTCGTGCTA 
CTTCCTGGAG 
TTCGGACTAC 



aicid 

single 

.own 



1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1642 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



Met Glu Phe Ser Ser Prd Ser Arg Glu Glu Cys Pro Lys Pro Leu Ser 
1 5 1 10 15 

Arg Val Ser He Met Alal Gly Ser Leu Thr Gly Leu Leu Leu Leu Gin 
20 1 25 30 

Ala val Ser Trp Ala Serlcly Ala Arg Pro Cys He Pro Lys Ser Phe 
35 1 40 45 

Gly Tyr Ser Ser Val Val 1 Cys Val Cys Asn Ala Thr Tyr Cys Asp Ser 
50 l55 60 

Phe ASP Pro Pro Thr Phe Ipro Ala Leu Gly Thr Phe Ser Arg Tyr Glu 
65 70 I 75 80 

Ser Thr Arg Ser Gly Arg krg Met Glu Leu Ser Met Gly Pro lie Gin 
85 I 90 

Ala Asn His Thr Gly Thr ^ly Leu Leu Leu Thr Leu Gin Pro Glu Gin 
100 t 110 

Lys Phe Gin Lys Val Lys ^ly Phe Gly Gly Ala Met Thr Asp Ala Ala 
115 1 120 125 

Ala Leu Asn He Leu Ala lieu Ser Pro Pro Ala Gin Asn Leu Leu Leu 
130 1135 140 

Lys Ser Tyr Phe Ser Glu dlu Gly He Gly Tyr Asn He He Arg Val 
145 150 I 155 160 
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Pro Met Ala Ser Cys Asp Phe Ser He Arg Thr Tyr Thr Tyr Ala Asp 
170 



165 



Thr Pro Asp Asp Phe Gin L«>u His Asn Phe Ser Leu Pro Glu Glu Asp 

185 



180 



Thr Lys Leu Lys He Pro lie His Arg Ala Leu Gin Leu Ala Gin 



195 

Ara Pro Val Ser Leu Leu AL 
210 2^ 



225 



230 



245 



260 



275 



190 



200 



a ser Pro Trp Thr Ser Pro Thr Trp Leu 
5 220 



Lys Thr Asn Gly Ala Val A^n Gly Lys Gly Ser Leu Lys Gly Gin Pro 



240 



Gly Asp lie Tyr His Gin Tjir Trp Ala Arg Tyr Phe Val Lys Phe Leu 



ASP Ala Tyr Ala Glu His l|s Leu Gin Phe Trp Ala Val Thr Ala Glu 



270 



Asn Glu Pro ser Ala Gly L«m Leu Ser Gly Tyr Pro Phe Gin Cys Leu 

280 -^^^ 



Gly Phe Thr Pro Glu His gL Arg Asp Phe He Ala Arg Asp Leu Gly 
290 295 

Pro Thr Leu Ala Asn Ser Tjir His His Asn Val Arg Leu Leu Met Leu 
305 310 * 315 

ASP Asp Gin Arg Leu Leu lU Pro His Trp Ala Lys Val Val Leu Thr 
325 I 330 

Asp Pro Glu Ala Ala Lys tU Val His Gly He Ala Val His Trp Tyr 
340 r 345 350 

Leu Asp Phe Leu Ala Pro aL Lys Ala Thr Leu Gly Glu Thr His Arg 
355 360 365 

Leu Phe Pro Asn Thr Met lLu Phe Ala Ser Glu Ala Cys Val Gly Ser 
370 3b5 380 

Lys Phe Trp Glu Gin Ser vLl Arg Leu Gly Ser Trp Asp Arg Gly Met 
385 390 1 395 400 

Gin Tyr Ser His Ser He lU Thr Asn Leu Leu Tyr His Val Val Gly 
405 I 410 415 

Trp Thr Asp Trp Asn Leu A^a Leu Asn Pro Glu Gly Gly Pro Asn Trp 
420 I 425 430 

val Arg Asn Phe Val Asp sir Pro He He Val Asp Val Thr Lys Asp 
435 1 440 445 

Thr Phe Tyr Lys Gin Pro M^t Phe Tyr His Leu Gly His Phe Ser Lys 
450 4515 460 



Phe He Pro Glu Gly Ser Gl 



465 



470 



Asn Asp Leu Asp Ala Val Al 
485 

Val Val Val Leu Asn Arg Se 
500 

Asp Pro Ala Val Gly Phe Le 



Ara Val Gly Leu Val Ala Ser Gin Lys 



475 



480 



a Leu Met His Pro Asp Gly Ser Ala Val 
" 490 495 

r Ser Lys Asp Val Pro Leu Thr He Lys 
^ 505 510 

u Glu Thr He Ser Pro Gly Tyr Ser He 
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515 



520 



525 



His Thr Tyr Leu Trp A^g Arg Gin Asn Ser Asp Tyr Lys Asp Asp Asp 
530 / 535 540 

Asp Lys 
545 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARActERISTICS: 

(A) LENGTH: 4613 base pairs 

(B) TYPE: nuclieic acid 

( C ) STRANDEDNE SS : unknown 

( D ) TOPOLOGY : unknown 



(ii) MOLECULE TYPE: 



other nucleic acid 



<j±jjCs\^u xjc* ~ 

(A) DESCRIPTION: /desc = "MeGA Promoter" 

(xi) SEQUENCE DESCFIPTION: SEQ ID NO: 5: 
CAATACGATA TTACCGAATA VTATACTAAA TCAAAATTTA ATTTATCATA 
AACTGATATT TCAAATTTTA J^TATTTAATA TCTACTTTCA ACTATTATTA 
AAATGCAAAA TGTATGAGTT ATTTCATAAT AGCCCAGTTC GTATCCAAAT 
TGACCAGTCA ACTTGACTAT ATAAAACTTT ACTTCAAAAA ATTAAAAAAA 
TATTATTGTA AAAGATAATA ::TCCATTCAA AATATAAAAT GAAAAAAGTC 
ACCGGGTTCC TATAAATACA CTTCCTACAT CTTCTCTTCT CCTCACATCC 
CTTTTAACAA TTATACTTGT ITAATCATCAA TCCCACAAAC AACACTTTTT 
TTTCCTCACC GGCGGCAGAC TTACCGGTGA AAGTAAGCAG STC 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARJ^CTERISTICS 



TCAATTATTA 
CCTAATTATC 
ATTTTACACT 
AAAGAAAGTA 
CAGCGCGGCA 
CATCACTCTT 
CTCTCCTCTT 



(A) LENGTH: A 

(B) TYPE: nu<; 



(C) STRANDEDNESS : single 



(D) TOPOLOGY 
(ii) MOLECULE TYPE 



(A) LENGTH: 

(B) TYPE: nu 



2 base pairs 
leic acid 



unknown 

other nucleic acid 



(A) DESCRIPT'ON: /desc = "PCR primer' 

(xi) SEQUENCE DESCi :IPTION : SEQ ID NO: 6: 
CTAGTCTAGA ATGCGTCCCC TGCGCCCCCG CG 
(2) INFORMATION FOR SE^! ID NO: 7: 

(i) SEQUENCE CHAR^^CTERISTICS : 



;;3 base pairs 
cleic acid 



(C) STRANDEDIIESS: single 



(D) TOPOLOGY 
(ii) MOLECULE TYPE 



unknown 
other nucleic acid 



60 
120 
180 
240 
300 
360 
420 
463 
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f 



K 



(A) DESCRIPTION: /(lesc = "PGR primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GGAATTCGAG CTCTCATGGA TTGCCCGGGG ATG 
(2) INFORMATION FOR SEQ ID ^0:8: 

(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 2067 

(B) TYPE: nucleic 

( C ) STRANDEDNESS 



Ipase pairs 
acid 
unknown 



(D ) TOPOLOGY : unk lown 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
ATGCGTCCCC T6CGCCCCCG CGCCGCGCTG CTGGCGCTCC TGGCCTCGCT CCTGGCCGCG 
CCCCCGGTGG CCCCGGCCGA GGCCCCGCAC CTGGTGCAGG TGGACGCGGC CCGCGCGCTG 
TGGCCCCTGC GGCGCTTCTG GAGGi^GCACA GGCTTCTGCC CCCCGCTGCC ACACAGCCAG 
GCTGACCAGT ACGTCCTCAG CTGGGACCAG CAGCTCAACC TCGCCTATGT GGGCGCCGTC 
CCTCACCGCG GCATCAAGCA GGTCCGGACC CACTGGCTGC TGGAGCTTGT CACCACCAGG 
GGGTCCACTG GAC6GGGCCT GAGC1 ACAAC TTCACCCACC TGGACGGGTA CTTGGACCTT 
CTCAGGGAGA ACCAGCTCCT CCCAGGGTTT GAGCTGATGG GCAGCGCCTC GGGCCACTTC 
ACTGACTTTG AGGACAAGCA GCAGC TGTTT GAGTGGAAGG ACTTGGTCTC CAGCCTGGCC 
AGGAGATACA TCGGTAGGTA CGGACTGGCG CATGTTTCCA AGTGGAACTT CGAGACGTGG 
AATGAGCCAG ACCACCACGA CTTTGACAAC GTCTCCATGA CCATGCAAGG CTTCCTGAAC 
TACTACGATG CCTGCTCGGA GGGTCTGCGC GCCGCCAGCC CCGCCCTGCG GCTGGGAGGC 
CCCGGCGACT CCTTCCACAC CCCACCGCGA TCCCCGCTGA GCTGGGGCCT CCTGCGCCAC 
TGCCACGACG GTACCAACTT CTTCi^CTGGG GAGGCGGGCG TGCGGCTGGA CTACATCTCC 
CTCCACAGGA AGGGTGCGCG CAGCT I^CATC TCCATCCTGG AGCAGGAGAA GGTCGTCGCG 
CACGAGATCC GGCAGCTCTT CCCCA \GTTC GCGGACACCC CCATTTACAA CGACGAGGCG 



GACCCGCTGG TGGGCTGGTC CCTGC 



;:acag ccgtggaggg cggacgtgac ctacgcggcc 

ATGGTGGTGA AGGTCATCGC GCAGCATCAG AACCTGCTAC TGGCCAACAC CACCTCCGCC 

TTCCCCTACG CGCTCCTGAG CAACGVCAAT GCCTTCCTGA GCTACCACCC GCACCCCTTC 

GCGCAGCGCA CGCTCACCGC GCGCTTCCAG GTCAACAACA CCCGCCCGCC GCACGTGCAG 

CTGTTGCGCA AGCCGGTGCT CACGGOCATG GGGCTGCTGG CGCTGCTGGA TGAGGAGCAG 

CTCTGGGCCG AAGTGTCGCA GGCCG(JGACC GTCCTGGACA GCAACCACAC GGTGGGCGTC 

CTGGCCAGCG CCCACCGCCC CCAGG(;CCCG GCCGACGCCT GGCGCGCCGC GGTGCTGATC 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



v 
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TACGCGAGCG ACGACACCCG CGCCCACCCC AACCGCAGCG 
CGCGGGGTGC CCCCCGGCCC GGGCCTGGTC TACGTCACGC 
TGCAGCCCCG ACGGCGAGTG GCGGCGCCTG GGCCGGCCCG 
TTCCGGCGCA TGCGCGCGGC TGAGgKcCCG GTGGCCGCGG 
GGCGGCCGCC TGACCCTGCG CCCCC CGCTG CGGCTGCCGT 
TGTGCGCGCC CCGAGAAGCC GCCCCGGCAG GTCACGCGGC 
CAAGGGCAGC TGGTTCTGGT CTGG^'CGGAT GAACACGTGG 
TACGAGATCC AGTTCTCTCA GGAC(JGTAAG GCGTACACCC 
ACCTTCAACC TCTTTGTGTT CAGCCCAGAC ACAGGTGCTG 
CGAGCCCTGG ACTACTGGGC CCGAOCAGGC CCCTTCTCGG 
GTCCCTGTGC CAAGAGGGCC CCCACCCCCG GGCAATCCAT 
TGGGTTGCAC CTCCACCGGC AGTCAGCGAG CTGGGGCTGC 
CCATCACCCC CTTTGCAATA TATTTTT 
(2) INFORMATION FOR SEQ ID 



(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 653 

(B) TYPE: cimino 

(C) STRANDEDNES5 



( D ) TOPOLOGY : ur known 
(ii) MOLECULE TYPE: peptide 



TCGCGGTGAC CCTGCGGCTG 1380 

GCTACCTGGA CAACGGGCTC 1440 

TCTTCCCCAC GGCA6AGCAG 1500 

CGCCCCGCCC CTTACCCGCC 1560 

CGCTTTTGCT GGTGCACGTG 1620 

TCCGCGCCCT GCCCCTGACC 1680 

GCTCCAAGTG CCTGTGGACA 1740 

CGGTCAGCAG GAAGCCATCG 1800 

TCTCTGGCTC CTACCGAGTT 1860 

ACCCTGT6CC GTACCTGGAG 1920 

GAGCCTGTGC TGAGCCCCAG 1980 

ACTGTGCCCA TGCTGCCCTC 2040 

2067 



NO: 9: 



amino acids 
acid 

single 



Leu Leu 



(xi) SEQUENCE DESCRIP1ION: SEQ ID NO: 9: 

Met Ara Pro Leu Arg Pro Arg Ala Ala Leu Leu Ala Leu Leu Ala Ser 
15 10 15 



Ala Ala Pro Pro Val Ala Pro Ala Glu Ala Pro His Leu Val 



20 



His Val Asp Ala Ala Arg Ala Leu Trp Pro Leu Arg Arg Phe Trp Arg 



35 



Ser Thr Gly Phe Cys Pro Pro Leu Pro His Ser Gin Ala Asp Gin Tyr 



50 



val Leu Ser Trp Asp Gin Gin Leu Asn Leu Ala Tyr Val Gly Ala Val 
65 710 75 80 

Pro His Arg Gly lie l^/s Gin Val Arg Thr His Trp Leu Leu Glu Leu 



85 



100 



115 

Gly Phe Glu Leu Met G 
130 



25 



30 



40 



45 



55 



60 



90 



95 



Val Thr Thr Arg Gly Ser Thr Gly Arg Gly Leu Ser Tyr Asn Phe Thr 



105 



110 



His Leu Asp Gly Thr leu Asp Leu Leu Arg Glu Asn Gin Leu Leu Pro 



120 



125 



ly Ser Ala Ser Gly His Phe Thr Asp Phe Glu 
135 140 
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Asp Lys Gin Gin Val Phe C lu Trp Lys Asp Leu Val Ser Ser Leu Ala 

- - - — * 155 ±QU 



145 



150 



Arg Arg Tyr lie Gly Arg -yr Gly Leu Ala His Val Ser Lys Trp Asn 

- — 170 1/5 



165 



Phe Glu Thr Trp Asn Glu :?ro Asp His His Asp Phe Asp Asn Val Ser 



180 



Met Thr Met Gin Gly Phe :.eu Asn Tyr Tyr Asp Ala Cys Ser Glu Gly 

o*^r^ 205 



195 



210 

Phe His Thr Pro Pro Arg 
225 230 



245 



260 

Leu Glu Gin Glu Lys Val 
275 

Lys Phe Ala Asp Thr Pro 
290 

Gly Trp Ser Leu Pro Gin 
305 310 

Met Val Val Lys Val lie 
325 

Thr Thr Ser Ala Phe Pro 
340 

Leu Ser Tyr His Pro His 
355 

Phe Gin Val Asn Asn Thr 
370 

Pro Val Leu Thr Ala Met 
385 390 

Leu Trp Ala Glu Val Ser 
405 



175 



185 



200 



Leu Arg Ala Ala Ser Pro ^la Leu Arg Leu Gly Gly Pro Gly Asp Ser 



lis 



5er Pro Leu Ser Trp Gly Leu Leu Arg His 
^ 235 240 



cys His Asp Gly Thr Asn Phe Phe Thr Gly Glu Ala Gly Val Arg Leu 

- ' - 2 50 V RR 



255 



Asp Tyr He Ser Leu His Arg Lys Gly Ala Arg Ser Ser lie Ser He 

^ ^ 265 



270 



val Ala Gin Glu He Arg Gin Leu Phe Pro 
280 285 

le Tyr Asn Asp Glu Ala Asp Pro Leu Val 
295 300 

Pro Trp Arg Ala Asp Val Thr Tyr Ala Ala 
315 320 

Ala Gin His Gin Asn Leu Leu Leu Ala Asn 
330 335 

rvr Ala Leu Leu Ser Asn Asp Asn Ala Phe 
^ 345 350 

Pro Phe Ala Gin Arg Thr Leu Thr Ala Arg 
360 365 

Vrg Pro Pro His Val Gin Leu Leu Arg Lys 

375 380 

;iy Leu Leu Ala Leu Leu Asp Glu Glu Gin 

^ 395 400 

(Jin Ala Gly Thr Val Leu Asp Ser Asn His 
^ 410 415 



Thr Val Gly Val Leu Ala 5Jer Ala His Arg Pro Gin Gly Pro Ala Asp 

1 --"^ 430 



420 



435 



425 



Ala Trp Arg Ala Ala Val Leu He Tyr Ala Ser Asp Asp Thr Arg Ala 

jt A r\ 445 



440 



His Pro Asn Arg Ser Val Ma Val Thr Leu Arg Leu Arg Gly Val Pro 

- ^ 460 



450 



^55 



Pro Glv Pro Gly Leu Val Vyr Val Thr Arg Tyr Leu Asp Asn Gly Leu 
^ .^r. 475 ARn 



465 



470 



485 



480 



Cvs Ser Pro Asp Gly Glu ':rp Arg Arg Leu Gly Arg Pro Val Phe Pro 
^ ^ 490 ^0*=^ 



495 



Thr Ala Glu Gin Phe Arg Arg Met Arg Ala Ala Glu Asp Pro Val Ala 



V 
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500 

Ala Ala Pro Arg Pro Leu Pro Ala 
515 / 520 

Ala Leu Arg Leu Pro Spr Leu Leu 
530 / 535 

Glu Lys Pro Pro Gly Gin Val Thr 
545 ^50 

Gin Gly Gin Leu Val Leu Val Trp 
565 

Cys Leu Trp Thr Tyr / Glu He Gin 
580 

Thr Pro Val Ser Arcl Lys Pro Ser 
595 T 600 

Pro Asp Thr Gly Alk Val Ser Gly 
610 1 615 

Tyr Trp Ala Arg Pro Gly Pro Phe 
625 I 630 

Val Pro Val Pro Alrg Gly Pro Pro 
6145 

(2) INFORMATION FOR SBQ ID NO: 10: 

(i) SEQUENCE CHABb^CTERISTICS : 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEpNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



505 

Gly Gly Arg Leu 

Leu Val His Val 
540 

Arg Leu Arg Ala 
555 

Ser Asp Glu His 
570 

Phe Ser Gin Asp 
585 

Thr Phe Asn Leu 



Ser Tyr Arg Val 
620 

Ser Asp Pro Val 

635 

Ser Pro Gly Asn 
650 



510 

Thr Leu 
525 

Cys Ala 

Leu Pro 

Val Gly 

Gly Lys 
590 

Phe Val 
605 

Arg Ala 
Pro Tyr 
Pro 



Arg Pro 

Arg Pro 

Leu Thr 
560 

Ser Lys 
575 

Ala Tyr 

Phe Ser 

Leu Asp 

Leu Glu 
640 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Asp Tyr Lys Asp [Asp Asp Asp Lys 
1 

(2) INFORMATION FOR SEQ ID NO: 11 J 

(i) SEQUENCE CHApACTERISTICS: 

(A) LENGTH:! 24 base pairs 

(B) TYPE: njucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "PCR primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



GCCTATGCTG AGCACAAGTl 



(2) INFORMATION FOR SEQ ID NO: 12 



ACAG 



24 
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(i) SEQUENCE CHARACTERipTICS : 

(A) LENGTH: 34 bade pairs 

(B) TYPE: nucleic /acid 

(C) STRANDEDNESS : / unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: j/desc = "Complementary sequence of a 
PCR primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TCCCTTGAGC TCGTCACTGG CGAC 5CCACA GGTA 



34 



v 
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